Overview
Brought to you by YData
Dataset statistics
| Number of variables | 32 |
|---|---|
| Number of observations | 66448 |
| Missing cells | 73318 |
| Missing cells (%) | 3.4% |
| Duplicate rows | 4394 |
| Duplicate rows (%) | 6.6% |
| Total size in memory | 16.2 MiB |
| Average record size in memory | 256.0 B |
Variable types
| Categorical | 16 |
|---|---|
| Numeric | 14 |
| Text | 1 |
| DateTime | 1 |
| Dataset has 4394 (6.6%) duplicate rows | Duplicates |
agent is highly overall correlated with company and 1 other fields | High correlation |
arrival_date_month is highly overall correlated with arrival_date_week_number | High correlation |
arrival_date_week_number is highly overall correlated with arrival_date_month | High correlation |
assigned_room_type is highly overall correlated with reserved_room_type | High correlation |
company is highly overall correlated with agent | High correlation |
distribution_channel is highly overall correlated with market_segment | High correlation |
hotel is highly overall correlated with agent | High correlation |
is_canceled is highly overall correlated with reservation_status | High correlation |
market_segment is highly overall correlated with distribution_channel | High correlation |
reservation_status is highly overall correlated with is_canceled | High correlation |
reserved_room_type is highly overall correlated with assigned_room_type | High correlation |
children is highly imbalanced (80.0%) | Imbalance |
babies is highly imbalanced (96.1%) | Imbalance |
meal is highly imbalanced (53.1%) | Imbalance |
distribution_channel is highly imbalanced (61.5%) | Imbalance |
is_repeated_guest is highly imbalanced (82.2%) | Imbalance |
reserved_room_type is highly imbalanced (53.0%) | Imbalance |
deposit_type is highly imbalanced (62.5%) | Imbalance |
customer_type is highly imbalanced (50.3%) | Imbalance |
required_car_parking_spaces is highly imbalanced (81.7%) | Imbalance |
agent has 10124 (15.2%) missing values | Missing |
company has 62703 (94.4%) missing values | Missing |
adults is highly skewed (γ1 = 27.11728186) | Skewed |
previous_cancellations is highly skewed (γ1 = 22.47111231) | Skewed |
lead_time has 3749 (5.6%) zeros | Zeros |
stays_in_weekend_nights has 27125 (40.8%) zeros | Zeros |
stays_in_week_nights has 3977 (6.0%) zeros | Zeros |
previous_cancellations has 65353 (98.4%) zeros | Zeros |
previous_bookings_not_canceled has 64416 (96.9%) zeros | Zeros |
booking_changes has 56399 (84.9%) zeros | Zeros |
days_in_waiting_list has 63984 (96.3%) zeros | Zeros |
adr has 975 (1.5%) zeros | Zeros |
total_of_special_requests has 42616 (64.1%) zeros | Zeros |
Reproduction
| Analysis started | 2024-11-12 16:08:41.691013 |
|---|---|
| Analysis finished | 2024-11-12 16:09:55.723122 |
| Duration | 1 minute and 14.03 seconds |
| Software version | ydata-profiling vv4.12.0 |
| Download configuration | config.json |
Variables
hotel
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 519.2 KiB |
| Resort Hotel | |
|---|---|
| City Hotel |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 11.205755 |
| Min length | 10 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Resort Hotel |
|---|---|
| 2nd row | Resort Hotel |
| 3rd row | Resort Hotel |
| 4th row | Resort Hotel |
| 5th row | Resort Hotel |
Common Values
| Value | Count | Frequency (%) |
| Resort Hotel | 40060 | |
| City Hotel | 26388 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| hotel | 66448 | |
| resort | 40060 | |
| city | 26388 | 19.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 132896 | |
| e | 106508 | |
| o | 106508 | |
| 66448 | ||
| H | 66448 | |
| l | 66448 | |
| R | 40060 | 5.4% |
| s | 40060 | 5.4% |
| r | 40060 | 5.4% |
| C | 26388 | 3.5% |
| Other values (2) | 52776 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 744600 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| t | 132896 | |
| e | 106508 | |
| o | 106508 | |
| 66448 | ||
| H | 66448 | |
| l | 66448 | |
| R | 40060 | 5.4% |
| s | 40060 | 5.4% |
| r | 40060 | 5.4% |
| C | 26388 | 3.5% |
| Other values (2) | 52776 | 7.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 744600 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| t | 132896 | |
| e | 106508 | |
| o | 106508 | |
| 66448 | ||
| H | 66448 | |
| l | 66448 | |
| R | 40060 | 5.4% |
| s | 40060 | 5.4% |
| r | 40060 | 5.4% |
| C | 26388 | 3.5% |
| Other values (2) | 52776 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 744600 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| t | 132896 | |
| e | 106508 | |
| o | 106508 | |
| 66448 | ||
| H | 66448 | |
| l | 66448 | |
| R | 40060 | 5.4% |
| s | 40060 | 5.4% |
| r | 40060 | 5.4% |
| C | 26388 | 3.5% |
| Other values (2) | 52776 | 7.1% |
is_canceled
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 519.2 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 34681 | |
| 1 | 31767 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 34681 | |
| 1 | 31767 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 34681 | |
| 1 | 31767 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 66448 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 34681 | |
| 1 | 31767 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 66448 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 34681 | |
| 1 | 31767 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 66448 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 34681 | |
| 1 | 31767 |
lead_time
Real number (ℝ)
Zeros 
| Distinct | 454 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 104.29903 |
| Minimum | 0 |
|---|---|
| Maximum | 737 |
| Zeros | 3749 |
| Zeros (%) | 5.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 519.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 19 |
| median | 70 |
| Q3 | 160 |
| 95-th percentile | 316 |
| Maximum | 737 |
| Range | 737 |
| Interquartile range (IQR) | 141 |
Descriptive statistics
| Standard deviation | 107.47458 |
|---|---|
| Coefficient of variation (CV) | 1.0304466 |
| Kurtosis | 2.2741708 |
| Mean | 104.29903 |
| Median Absolute Deviation (MAD) | 60 |
| Skewness | 1.4451424 |
| Sum | 6930462 |
| Variance | 11550.785 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3749 | 5.6% |
| 1 | 1948 | 2.9% |
| 2 | 1138 | 1.7% |
| 3 | 967 | 1.5% |
| 4 | 855 | 1.3% |
| 5 | 760 | 1.1% |
| 7 | 717 | 1.1% |
| 6 | 711 | 1.1% |
| 12 | 578 | 0.9% |
| 10 | 570 | 0.9% |
| Other values (444) | 54455 |
| Value | Count | Frequency (%) |
| 0 | 3749 | |
| 1 | 1948 | |
| 2 | 1138 | 1.7% |
| 3 | 967 | 1.5% |
| 4 | 855 | 1.3% |
| 5 | 760 | 1.1% |
| 6 | 711 | 1.1% |
| 7 | 717 | 1.1% |
| 8 | 544 | 0.8% |
| 9 | 527 | 0.8% |
| Value | Count | Frequency (%) |
| 737 | 1 | < 0.1% |
| 709 | 1 | < 0.1% |
| 629 | 17 | |
| 626 | 30 | |
| 622 | 17 | |
| 615 | 17 | |
| 608 | 17 | |
| 605 | 30 | |
| 601 | 17 | |
| 594 | 17 |
arrival_date_year
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 519.2 KiB |
| 2016 | |
|---|---|
| 2017 | |
| 2015 |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2015 |
|---|---|
| 2nd row | 2015 |
| 3rd row | 2015 |
| 4th row | 2015 |
| 5th row | 2015 |
Common Values
| Value | Count | Frequency (%) |
| 2016 | 34210 | |
| 2017 | 17563 | |
| 2015 | 14675 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 2016 | 34210 | |
| 2017 | 17563 | |
| 2015 | 14675 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 66448 | |
| 0 | 66448 | |
| 1 | 66448 | |
| 6 | 34210 | |
| 7 | 17563 | 6.6% |
| 5 | 14675 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 265792 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 66448 | |
| 0 | 66448 | |
| 1 | 66448 | |
| 6 | 34210 | |
| 7 | 17563 | 6.6% |
| 5 | 14675 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 265792 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 66448 | |
| 0 | 66448 | |
| 1 | 66448 | |
| 6 | 34210 | |
| 7 | 17563 | 6.6% |
| 5 | 14675 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 265792 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 66448 | |
| 0 | 66448 | |
| 1 | 66448 | |
| 6 | 34210 | |
| 7 | 17563 | 6.6% |
| 5 | 14675 | 5.5% |
arrival_date_month
Categorical
High correlation 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 519.2 KiB |
| August | |
|---|---|
| October | |
| September | |
| April | |
| July | |
| Other values (7) |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 6.110974 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | July |
|---|---|
| 2nd row | July |
| 3rd row | July |
| 4th row | July |
| 5th row | July |
Common Values
| Value | Count | Frequency (%) |
| August | 7714 | |
| October | 6842 | |
| September | 6712 | |
| April | 6284 | |
| July | 6176 | |
| March | 5766 | |
| May | 5282 | |
| February | 4798 | |
| June | 4725 | |
| November | 4204 | |
| Other values (2) | 7945 |
Length
| Value | Count | Frequency (%) |
| august | 7714 | |
| october | 6842 | |
| september | 6712 | |
| april | 6284 | |
| july | 6176 | |
| march | 5766 | |
| may | 5282 | |
| february | 4798 | |
| june | 4725 | |
| november | 4204 | |
| Other values (2) | 7945 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 57344 | |
| r | 47349 | 11.7% |
| u | 34927 | 8.6% |
| b | 26701 | 6.6% |
| a | 23446 | 5.8% |
| t | 21268 | 5.2% |
| y | 20056 | 4.9% |
| c | 16753 | 4.1% |
| m | 15061 | 3.7% |
| J | 14701 | 3.6% |
| Other values (16) | 128456 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 406062 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 57344 | |
| r | 47349 | 11.7% |
| u | 34927 | 8.6% |
| b | 26701 | 6.6% |
| a | 23446 | 5.8% |
| t | 21268 | 5.2% |
| y | 20056 | 4.9% |
| c | 16753 | 4.1% |
| m | 15061 | 3.7% |
| J | 14701 | 3.6% |
| Other values (16) | 128456 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 406062 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 57344 | |
| r | 47349 | 11.7% |
| u | 34927 | 8.6% |
| b | 26701 | 6.6% |
| a | 23446 | 5.8% |
| t | 21268 | 5.2% |
| y | 20056 | 4.9% |
| c | 16753 | 4.1% |
| m | 15061 | 3.7% |
| J | 14701 | 3.6% |
| Other values (16) | 128456 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 406062 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 57344 | |
| r | 47349 | 11.7% |
| u | 34927 | 8.6% |
| b | 26701 | 6.6% |
| a | 23446 | 5.8% |
| t | 21268 | 5.2% |
| y | 20056 | 4.9% |
| c | 16753 | 4.1% |
| m | 15061 | 3.7% |
| J | 14701 | 3.6% |
| Other values (16) | 128456 |
arrival_date_week_number
Real number (ℝ)
High correlation 
| Distinct | 53 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.524696 |
| Minimum | 1 |
|---|---|
| Maximum | 53 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 519.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 15 |
| median | 29 |
| Q3 | 39 |
| 95-th percentile | 49 |
| Maximum | 53 |
| Range | 52 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 14.175895 |
|---|---|
| Coefficient of variation (CV) | 0.51502458 |
| Kurtosis | -1.1018194 |
| Mean | 27.524696 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | -0.081017607 |
| Sum | 1828961 |
| Variance | 200.956 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 33 | 2023 | 3.0% |
| 15 | 1711 | 2.6% |
| 34 | 1703 | 2.6% |
| 41 | 1667 | 2.5% |
| 38 | 1663 | 2.5% |
| 32 | 1619 | 2.4% |
| 42 | 1617 | 2.4% |
| 37 | 1555 | 2.3% |
| 40 | 1513 | 2.3% |
| 43 | 1505 | 2.3% |
| Other values (43) | 49872 |
| Value | Count | Frequency (%) |
| 1 | 625 | |
| 2 | 852 | |
| 3 | 888 | |
| 4 | 978 | |
| 5 | 801 | |
| 6 | 950 | |
| 7 | 1368 | |
| 8 | 1157 | |
| 9 | 1251 | |
| 10 | 1224 |
| Value | Count | Frequency (%) |
| 53 | 1130 | |
| 52 | 800 | |
| 51 | 617 | |
| 50 | 760 | |
| 49 | 1099 | |
| 48 | 900 | |
| 47 | 1047 | |
| 46 | 930 | |
| 45 | 1281 | |
| 44 | 1372 |
arrival_date_day_of_month
Real number (ℝ)
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.670675 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 519.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 8 |
| median | 16 |
| Q3 | 23 |
| 95-th percentile | 30 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 8.8050254 |
|---|---|
| Coefficient of variation (CV) | 0.56187915 |
| Kurtosis | -1.1807551 |
| Mean | 15.670675 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.026508746 |
| Sum | 1041285 |
| Variance | 77.528472 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 2506 | 3.8% |
| 12 | 2474 | 3.7% |
| 16 | 2464 | 3.7% |
| 17 | 2372 | 3.6% |
| 18 | 2357 | 3.5% |
| 25 | 2342 | 3.5% |
| 2 | 2341 | 3.5% |
| 9 | 2340 | 3.5% |
| 15 | 2334 | 3.5% |
| 30 | 2306 | 3.5% |
| Other values (21) | 42612 |
| Value | Count | Frequency (%) |
| 1 | 2090 | |
| 2 | 2341 | |
| 3 | 2160 | |
| 4 | 2120 | |
| 5 | 2506 | |
| 6 | 2032 | |
| 7 | 2196 | |
| 8 | 2101 | |
| 9 | 2340 | |
| 10 | 1903 |
| Value | Count | Frequency (%) |
| 31 | 1302 | |
| 30 | 2306 | |
| 29 | 1891 | |
| 28 | 2101 | |
| 27 | 1924 | |
| 26 | 2306 | |
| 25 | 2342 | |
| 24 | 2133 | |
| 23 | 2034 | |
| 22 | 1964 |
stays_in_weekend_nights
Real number (ℝ)
Zeros 
| Distinct | 17 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.0312425 |
| Minimum | 0 |
|---|---|
| Maximum | 19 |
| Zeros | 27125 |
| Zeros (%) | 40.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 519.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 19 |
| Range | 19 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.083307 |
|---|---|
| Coefficient of variation (CV) | 1.0504871 |
| Kurtosis | 7.465002 |
| Mean | 1.0312425 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.4652904 |
| Sum | 68524 |
| Variance | 1.173554 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 27125 | |
| 2 | 20216 | |
| 1 | 16031 | |
| 4 | 1706 | 2.6% |
| 3 | 1066 | 1.6% |
| 6 | 143 | 0.2% |
| 5 | 61 | 0.1% |
| 8 | 54 | 0.1% |
| 7 | 19 | < 0.1% |
| 9 | 8 | < 0.1% |
| Other values (7) | 19 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 27125 | |
| 1 | 16031 | |
| 2 | 20216 | |
| 3 | 1066 | 1.6% |
| 4 | 1706 | 2.6% |
| 5 | 61 | 0.1% |
| 6 | 143 | 0.2% |
| 7 | 19 | < 0.1% |
| 8 | 54 | 0.1% |
| 9 | 8 | < 0.1% |
| Value | Count | Frequency (%) |
| 19 | 1 | < 0.1% |
| 18 | 1 | < 0.1% |
| 16 | 2 | < 0.1% |
| 14 | 1 | < 0.1% |
| 13 | 2 | < 0.1% |
| 12 | 5 | < 0.1% |
| 10 | 7 | < 0.1% |
| 9 | 8 | < 0.1% |
| 8 | 54 | |
| 7 | 19 | < 0.1% |
stays_in_week_nights
Real number (ℝ)
Zeros 
| Distinct | 33 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.8015892 |
| Minimum | 0 |
|---|---|
| Maximum | 50 |
| Zeros | 3977 |
| Zeros (%) | 6.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 519.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 4 |
| 95-th percentile | 6 |
| Maximum | 50 |
| Range | 50 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.1883284 |
|---|---|
| Coefficient of variation (CV) | 0.78110253 |
| Kurtosis | 19.55695 |
| Mean | 2.8015892 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 2.6643508 |
| Sum | 186160 |
| Variance | 4.7887813 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 17216 | |
| 1 | 15092 | |
| 3 | 11575 | |
| 5 | 8853 | |
| 4 | 5469 | 8.2% |
| 0 | 3977 | 6.0% |
| 6 | 1273 | 1.9% |
| 10 | 966 | 1.5% |
| 7 | 918 | 1.4% |
| 8 | 551 | 0.8% |
| Other values (23) | 558 | 0.8% |
| Value | Count | Frequency (%) |
| 0 | 3977 | 6.0% |
| 1 | 15092 | |
| 2 | 17216 | |
| 3 | 11575 | |
| 4 | 5469 | 8.2% |
| 5 | 8853 | |
| 6 | 1273 | 1.9% |
| 7 | 918 | 1.4% |
| 8 | 551 | 0.8% |
| 9 | 195 | 0.3% |
| Value | Count | Frequency (%) |
| 50 | 1 | < 0.1% |
| 42 | 1 | < 0.1% |
| 40 | 2 | < 0.1% |
| 34 | 1 | < 0.1% |
| 33 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| 30 | 4 | |
| 26 | 1 | < 0.1% |
| 25 | 5 | |
| 24 | 1 | < 0.1% |
adults
Real number (ℝ)
Skewed 
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.8648116 |
| Minimum | 0 |
|---|---|
| Maximum | 55 |
| Zeros | 142 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 519.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 2 |
| Maximum | 55 |
| Range | 55 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.61908148 |
|---|---|
| Coefficient of variation (CV) | 0.33198072 |
| Kurtosis | 1860.2495 |
| Mean | 1.8648116 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 27.117282 |
| Sum | 123913 |
| Variance | 0.38326188 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 51718 | |
| 1 | 11835 | 17.8% |
| 3 | 2701 | 4.1% |
| 0 | 142 | 0.2% |
| 4 | 36 | 0.1% |
| 26 | 5 | < 0.1% |
| 27 | 2 | < 0.1% |
| 20 | 2 | < 0.1% |
| 5 | 2 | < 0.1% |
| 40 | 1 | < 0.1% |
| Other values (4) | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 142 | 0.2% |
| 1 | 11835 | 17.8% |
| 2 | 51718 | |
| 3 | 2701 | 4.1% |
| 4 | 36 | 0.1% |
| 5 | 2 | < 0.1% |
| 6 | 1 | < 0.1% |
| 10 | 1 | < 0.1% |
| 20 | 2 | < 0.1% |
| 26 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 55 | 1 | < 0.1% |
| 50 | 1 | < 0.1% |
| 40 | 1 | < 0.1% |
| 27 | 2 | < 0.1% |
| 26 | 5 | < 0.1% |
| 20 | 2 | < 0.1% |
| 10 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
| 5 | 2 | < 0.1% |
| 4 | 36 |
children
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4 |
| Missing (%) | < 0.1% |
| Memory size | 519.2 KiB |
| 0.0 | |
|---|---|
| 1.0 | 2637 |
| 2.0 | 2330 |
| 3.0 | 30 |
| 10.0 | 1 |
Length
| Max length | 4 |
|---|---|
| Median length | 3 |
| Mean length | 3.0000151 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 61446 | |
| 1.0 | 2637 | 4.0% |
| 2.0 | 2330 | 3.5% |
| 3.0 | 30 | < 0.1% |
| 10.0 | 1 | < 0.1% |
| (Missing) | 4 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 61446 | |
| 1.0 | 2637 | 4.0% |
| 2.0 | 2330 | 3.5% |
| 3.0 | 30 | < 0.1% |
| 10.0 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 127891 | |
| . | 66444 | |
| 1 | 2638 | 1.3% |
| 2 | 2330 | 1.2% |
| 3 | 30 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 199333 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 127891 | |
| . | 66444 | |
| 1 | 2638 | 1.3% |
| 2 | 2330 | 1.2% |
| 3 | 30 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 199333 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 127891 | |
| . | 66444 | |
| 1 | 2638 | 1.3% |
| 2 | 2330 | 1.2% |
| 3 | 30 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 199333 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 127891 | |
| . | 66444 | |
| 1 | 2638 | 1.3% |
| 2 | 2330 | 1.2% |
| 3 | 30 | < 0.1% |
babies
Categorical
Imbalance 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 519.2 KiB |
| 0 | |
|---|---|
| 1 | 614 |
| 2 | 9 |
| 10 | 1 |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.000015 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 65824 | |
| 1 | 614 | 0.9% |
| 2 | 9 | < 0.1% |
| 10 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 65824 | |
| 1 | 614 | 0.9% |
| 2 | 9 | < 0.1% |
| 10 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 65825 | |
| 1 | 615 | 0.9% |
| 2 | 9 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 66449 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 65825 | |
| 1 | 615 | 0.9% |
| 2 | 9 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 66449 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 65825 | |
| 1 | 615 | 0.9% |
| 2 | 9 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 66449 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 65825 | |
| 1 | 615 | 0.9% |
| 2 | 9 | < 0.1% |
meal
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 519.2 KiB |
| BB | |
|---|---|
| HB | |
| SC | 3025 |
| Undefined | 1169 |
| FB | 790 |
Length
| Max length | 9 |
|---|---|
| Median length | 2 |
| Mean length | 2.1231489 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BB |
|---|---|
| 2nd row | BB |
| 3rd row | BB |
| 4th row | BB |
| 5th row | BB |
Common Values
| Value | Count | Frequency (%) |
| BB | 51129 | |
| HB | 10335 | 15.6% |
| SC | 3025 | 4.6% |
| Undefined | 1169 | 1.8% |
| FB | 790 | 1.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| bb | 51129 | |
| hb | 10335 | 15.6% |
| sc | 3025 | 4.6% |
| undefined | 1169 | 1.8% |
| fb | 790 | 1.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| B | 113383 | |
| H | 10335 | 7.3% |
| S | 3025 | 2.1% |
| C | 3025 | 2.1% |
| n | 2338 | 1.7% |
| d | 2338 | 1.7% |
| e | 2338 | 1.7% |
| U | 1169 | 0.8% |
| f | 1169 | 0.8% |
| i | 1169 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 141079 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| B | 113383 | |
| H | 10335 | 7.3% |
| S | 3025 | 2.1% |
| C | 3025 | 2.1% |
| n | 2338 | 1.7% |
| d | 2338 | 1.7% |
| e | 2338 | 1.7% |
| U | 1169 | 0.8% |
| f | 1169 | 0.8% |
| i | 1169 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 141079 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| B | 113383 | |
| H | 10335 | 7.3% |
| S | 3025 | 2.1% |
| C | 3025 | 2.1% |
| n | 2338 | 1.7% |
| d | 2338 | 1.7% |
| e | 2338 | 1.7% |
| U | 1169 | 0.8% |
| f | 1169 | 0.8% |
| i | 1169 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 141079 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| B | 113383 | |
| H | 10335 | 7.3% |
| S | 3025 | 2.1% |
| C | 3025 | 2.1% |
| n | 2338 | 1.7% |
| d | 2338 | 1.7% |
| e | 2338 | 1.7% |
| U | 1169 | 0.8% |
| f | 1169 | 0.8% |
| i | 1169 | 0.8% |
country
Text
| Distinct | 147 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 486 |
| Missing (%) | 0.7% |
| Memory size | 519.2 KiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.987326 |
| Min length | 2 |
Unique
| Unique | 28 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | PRT |
|---|---|
| 2nd row | PRT |
| 3rd row | GBR |
| 4th row | GBR |
| 5th row | GBR |
| Value | Count | Frequency (%) |
| prt | 31276 | |
| gbr | 7980 | 12.1% |
| esp | 5681 | 8.6% |
| fra | 3514 | 5.3% |
| irl | 2464 | 3.7% |
| deu | 2370 | 3.6% |
| ita | 1689 | 2.6% |
| bra | 1019 | 1.5% |
| nld | 844 | 1.3% |
| cn | 836 | 1.3% |
| Other values (137) | 8289 | 12.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 48057 | |
| P | 37614 | |
| T | 33794 | |
| E | 10338 | 5.2% |
| B | 9932 | 5.0% |
| G | 8512 | 4.3% |
| A | 8423 | 4.3% |
| S | 7907 | 4.0% |
| L | 5073 | 2.6% |
| U | 5021 | 2.5% |
| Other values (16) | 22379 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 197050 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| R | 48057 | |
| P | 37614 | |
| T | 33794 | |
| E | 10338 | 5.2% |
| B | 9932 | 5.0% |
| G | 8512 | 4.3% |
| A | 8423 | 4.3% |
| S | 7907 | 4.0% |
| L | 5073 | 2.6% |
| U | 5021 | 2.5% |
| Other values (16) | 22379 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 197050 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| R | 48057 | |
| P | 37614 | |
| T | 33794 | |
| E | 10338 | 5.2% |
| B | 9932 | 5.0% |
| G | 8512 | 4.3% |
| A | 8423 | 4.3% |
| S | 7907 | 4.0% |
| L | 5073 | 2.6% |
| U | 5021 | 2.5% |
| Other values (16) | 22379 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 197050 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| R | 48057 | |
| P | 37614 | |
| T | 33794 | |
| E | 10338 | 5.2% |
| B | 9932 | 5.0% |
| G | 8512 | 4.3% |
| A | 8423 | 4.3% |
| S | 7907 | 4.0% |
| L | 5073 | 2.6% |
| U | 5021 | 2.5% |
| Other values (16) | 22379 |
market_segment
Categorical
High correlation 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 519.2 KiB |
| Online TA | |
|---|---|
| Offline TA/TO | |
| Groups | |
| Direct | |
| Corporate | 2804 |
| Other values (3) | 305 |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 8.919802 |
| Min length | 6 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Direct |
|---|---|
| 2nd row | Direct |
| 3rd row | Direct |
| 4th row | Corporate |
| 5th row | Online TA |
Common Values
| Value | Count | Frequency (%) |
| Online TA | 29901 | |
| Offline TA/TO | 13419 | |
| Groups | 12381 | |
| Direct | 7638 | 11.5% |
| Corporate | 2804 | 4.2% |
| Complementary | 271 | 0.4% |
| Aviation | 32 | < 0.1% |
| Undefined | 2 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| online | 29901 | |
| ta | 29901 | |
| offline | 13419 | |
| ta/to | 13419 | |
| groups | 12381 | |
| direct | 7638 | 7.0% |
| corporate | 2804 | 2.6% |
| complementary | 271 | 0.2% |
| aviation | 32 | < 0.1% |
| undefined | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 73528 | |
| O | 56739 | |
| T | 56739 | |
| e | 54308 | |
| i | 51024 | |
| l | 43591 | 7.4% |
| A | 43352 | 7.3% |
| 43320 | 7.3% | |
| f | 26840 | 4.5% |
| r | 25898 | 4.4% |
| Other values (16) | 117364 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 592703 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 73528 | |
| O | 56739 | |
| T | 56739 | |
| e | 54308 | |
| i | 51024 | |
| l | 43591 | 7.4% |
| A | 43352 | 7.3% |
| 43320 | 7.3% | |
| f | 26840 | 4.5% |
| r | 25898 | 4.4% |
| Other values (16) | 117364 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 592703 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 73528 | |
| O | 56739 | |
| T | 56739 | |
| e | 54308 | |
| i | 51024 | |
| l | 43591 | 7.4% |
| A | 43352 | 7.3% |
| 43320 | 7.3% | |
| f | 26840 | 4.5% |
| r | 25898 | 4.4% |
| Other values (16) | 117364 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 592703 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 73528 | |
| O | 56739 | |
| T | 56739 | |
| e | 54308 | |
| i | 51024 | |
| l | 43591 | 7.4% |
| A | 43352 | 7.3% |
| 43320 | 7.3% | |
| f | 26840 | 4.5% |
| r | 25898 | 4.4% |
| Other values (16) | 117364 |
distribution_channel
Categorical
High correlation  Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 519.2 KiB |
| TA/TO | |
|---|---|
| Direct | |
| Corporate | 3882 |
| GDS | 30 |
| Undefined | 5 |
Length
| Max length | 9 |
|---|---|
| Median length | 5 |
| Mean length | 5.3707561 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Direct |
|---|---|
| 2nd row | Direct |
| 3rd row | Direct |
| 4th row | Corporate |
| 5th row | TA/TO |
Common Values
| Value | Count | Frequency (%) |
| TA/TO | 53383 | |
| Direct | 9148 | 13.8% |
| Corporate | 3882 | 5.8% |
| GDS | 30 | < 0.1% |
| Undefined | 5 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ta/to | 53383 | |
| direct | 9148 | 13.8% |
| corporate | 3882 | 5.8% |
| gds | 30 | < 0.1% |
| undefined | 5 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 106766 | |
| / | 53383 | |
| O | 53383 | |
| A | 53383 | |
| r | 16912 | 4.7% |
| e | 13040 | 3.7% |
| t | 13030 | 3.7% |
| D | 9178 | 2.6% |
| i | 9153 | 2.6% |
| c | 9148 | 2.6% |
| Other values (10) | 19500 | 5.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 356876 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| T | 106766 | |
| / | 53383 | |
| O | 53383 | |
| A | 53383 | |
| r | 16912 | 4.7% |
| e | 13040 | 3.7% |
| t | 13030 | 3.7% |
| D | 9178 | 2.6% |
| i | 9153 | 2.6% |
| c | 9148 | 2.6% |
| Other values (10) | 19500 | 5.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 356876 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| T | 106766 | |
| / | 53383 | |
| O | 53383 | |
| A | 53383 | |
| r | 16912 | 4.7% |
| e | 13040 | 3.7% |
| t | 13030 | 3.7% |
| D | 9178 | 2.6% |
| i | 9153 | 2.6% |
| c | 9148 | 2.6% |
| Other values (10) | 19500 | 5.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 356876 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| T | 106766 | |
| / | 53383 | |
| O | 53383 | |
| A | 53383 | |
| r | 16912 | 4.7% |
| e | 13040 | 3.7% |
| t | 13030 | 3.7% |
| D | 9178 | 2.6% |
| i | 9153 | 2.6% |
| c | 9148 | 2.6% |
| Other values (10) | 19500 | 5.5% |
is_repeated_guest
Categorical
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 519.2 KiB |
| 0 | |
|---|---|
| 1 | 1778 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 64670 | |
| 1 | 1778 | 2.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 64670 | |
| 1 | 1778 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 64670 | |
| 1 | 1778 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 66448 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 64670 | |
| 1 | 1778 | 2.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 66448 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 64670 | |
| 1 | 1778 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 66448 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 64670 | |
| 1 | 1778 | 2.7% |
previous_cancellations
Real number (ℝ)
Skewed  Zeros 
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.06132615 |
| Minimum | 0 |
|---|---|
| Maximum | 26 |
| Zeros | 65353 |
| Zeros (%) | 98.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 519.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 26 |
| Range | 26 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.0378418 |
|---|---|
| Coefficient of variation (CV) | 16.923315 |
| Kurtosis | 518.27002 |
| Mean | 0.06132615 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 22.471112 |
| Sum | 4075 |
| Variance | 1.0771155 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 65353 | |
| 1 | 896 | 1.3% |
| 24 | 48 | 0.1% |
| 2 | 44 | 0.1% |
| 26 | 26 | < 0.1% |
| 25 | 25 | < 0.1% |
| 19 | 19 | < 0.1% |
| 3 | 14 | < 0.1% |
| 14 | 14 | < 0.1% |
| 4 | 6 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 65353 | |
| 1 | 896 | 1.3% |
| 2 | 44 | 0.1% |
| 3 | 14 | < 0.1% |
| 4 | 6 | < 0.1% |
| 5 | 3 | < 0.1% |
| 14 | 14 | < 0.1% |
| 19 | 19 | < 0.1% |
| 24 | 48 | 0.1% |
| 25 | 25 | < 0.1% |
| Value | Count | Frequency (%) |
| 26 | 26 | < 0.1% |
| 25 | 25 | < 0.1% |
| 24 | 48 | 0.1% |
| 19 | 19 | < 0.1% |
| 14 | 14 | < 0.1% |
| 5 | 3 | < 0.1% |
| 4 | 6 | < 0.1% |
| 3 | 14 | < 0.1% |
| 2 | 44 | 0.1% |
| 1 | 896 |
previous_bookings_not_canceled
Real number (ℝ)
Zeros 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.088294606 |
| Minimum | 0 |
|---|---|
| Maximum | 30 |
| Zeros | 64416 |
| Zeros (%) | 96.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 519.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 30 |
| Range | 30 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.7812591 |
|---|---|
| Coefficient of variation (CV) | 8.848322 |
| Kurtosis | 405.48154 |
| Mean | 0.088294606 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 17.059997 |
| Sum | 5867 |
| Variance | 0.61036579 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 64416 | |
| 1 | 973 | 1.5% |
| 2 | 388 | 0.6% |
| 3 | 204 | 0.3% |
| 4 | 127 | 0.2% |
| 5 | 91 | 0.1% |
| 6 | 56 | 0.1% |
| 7 | 37 | 0.1% |
| 8 | 33 | < 0.1% |
| 9 | 24 | < 0.1% |
| Other values (21) | 99 | 0.1% |
| Value | Count | Frequency (%) |
| 0 | 64416 | |
| 1 | 973 | 1.5% |
| 2 | 388 | 0.6% |
| 3 | 204 | 0.3% |
| 4 | 127 | 0.2% |
| 5 | 91 | 0.1% |
| 6 | 56 | 0.1% |
| 7 | 37 | 0.1% |
| 8 | 33 | < 0.1% |
| 9 | 24 | < 0.1% |
| Value | Count | Frequency (%) |
| 30 | 1 | < 0.1% |
| 29 | 1 | < 0.1% |
| 28 | 1 | < 0.1% |
| 27 | 2 | |
| 26 | 1 | < 0.1% |
| 25 | 3 | |
| 24 | 2 | |
| 23 | 2 | |
| 22 | 2 | |
| 21 | 2 |
reserved_room_type
Categorical
High correlation  Imbalance 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 519.2 KiB |
| A | |
|---|---|
| D | |
| E | |
| F | 1701 |
| G | 1678 |
| Other values (5) | 2008 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | C |
|---|---|
| 2nd row | C |
| 3rd row | A |
| 4th row | A |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| A | 45190 | |
| D | 10588 | 15.9% |
| E | 5283 | 8.0% |
| F | 1701 | 2.6% |
| G | 1678 | 2.5% |
| C | 922 | 1.4% |
| H | 601 | 0.9% |
| B | 469 | 0.7% |
| P | 10 | < 0.1% |
| L | 6 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| a | 45190 | |
| d | 10588 | 15.9% |
| e | 5283 | 8.0% |
| f | 1701 | 2.6% |
| g | 1678 | 2.5% |
| c | 922 | 1.4% |
| h | 601 | 0.9% |
| b | 469 | 0.7% |
| p | 10 | < 0.1% |
| l | 6 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 45190 | |
| D | 10588 | 15.9% |
| E | 5283 | 8.0% |
| F | 1701 | 2.6% |
| G | 1678 | 2.5% |
| C | 922 | 1.4% |
| H | 601 | 0.9% |
| B | 469 | 0.7% |
| P | 10 | < 0.1% |
| L | 6 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 66448 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 45190 | |
| D | 10588 | 15.9% |
| E | 5283 | 8.0% |
| F | 1701 | 2.6% |
| G | 1678 | 2.5% |
| C | 922 | 1.4% |
| H | 601 | 0.9% |
| B | 469 | 0.7% |
| P | 10 | < 0.1% |
| L | 6 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 66448 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 45190 | |
| D | 10588 | 15.9% |
| E | 5283 | 8.0% |
| F | 1701 | 2.6% |
| G | 1678 | 2.5% |
| C | 922 | 1.4% |
| H | 601 | 0.9% |
| B | 469 | 0.7% |
| P | 10 | < 0.1% |
| L | 6 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 66448 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 45190 | |
| D | 10588 | 15.9% |
| E | 5283 | 8.0% |
| F | 1701 | 2.6% |
| G | 1678 | 2.5% |
| C | 922 | 1.4% |
| H | 601 | 0.9% |
| B | 469 | 0.7% |
| P | 10 | < 0.1% |
| L | 6 | < 0.1% |
assigned_room_type
Categorical
High correlation 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 519.2 KiB |
| A | |
|---|---|
| D | |
| E | |
| F | 2362 |
| C | 2229 |
| Other values (7) |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | C |
|---|---|
| 2nd row | C |
| 3rd row | C |
| 4th row | A |
| 5th row | A |
Common Values
| Value | Count | Frequency (%) |
| A | 37359 | |
| D | 14447 | 21.7% |
| E | 6076 | 9.1% |
| F | 2362 | 3.6% |
| C | 2229 | 3.4% |
| G | 1951 | 2.9% |
| B | 912 | 1.4% |
| H | 712 | 1.1% |
| I | 363 | 0.5% |
| K | 26 | < 0.1% |
| Other values (2) | 11 | < 0.1% |
Length
| Value | Count | Frequency (%) |
| a | 37359 | |
| d | 14447 | 21.7% |
| e | 6076 | 9.1% |
| f | 2362 | 3.6% |
| c | 2229 | 3.4% |
| g | 1951 | 2.9% |
| b | 912 | 1.4% |
| h | 712 | 1.1% |
| i | 363 | 0.5% |
| k | 26 | < 0.1% |
| Other values (2) | 11 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 37359 | |
| D | 14447 | 21.7% |
| E | 6076 | 9.1% |
| F | 2362 | 3.6% |
| C | 2229 | 3.4% |
| G | 1951 | 2.9% |
| B | 912 | 1.4% |
| H | 712 | 1.1% |
| I | 363 | 0.5% |
| K | 26 | < 0.1% |
| Other values (2) | 11 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 66448 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 37359 | |
| D | 14447 | 21.7% |
| E | 6076 | 9.1% |
| F | 2362 | 3.6% |
| C | 2229 | 3.4% |
| G | 1951 | 2.9% |
| B | 912 | 1.4% |
| H | 712 | 1.1% |
| I | 363 | 0.5% |
| K | 26 | < 0.1% |
| Other values (2) | 11 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 66448 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 37359 | |
| D | 14447 | 21.7% |
| E | 6076 | 9.1% |
| F | 2362 | 3.6% |
| C | 2229 | 3.4% |
| G | 1951 | 2.9% |
| B | 912 | 1.4% |
| H | 712 | 1.1% |
| I | 363 | 0.5% |
| K | 26 | < 0.1% |
| Other values (2) | 11 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 66448 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 37359 | |
| D | 14447 | 21.7% |
| E | 6076 | 9.1% |
| F | 2362 | 3.6% |
| C | 2229 | 3.4% |
| G | 1951 | 2.9% |
| B | 912 | 1.4% |
| H | 712 | 1.1% |
| I | 363 | 0.5% |
| K | 26 | < 0.1% |
| Other values (2) | 11 | < 0.1% |
booking_changes
Real number (ℝ)
Zeros 
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.22599627 |
| Minimum | 0 |
|---|---|
| Maximum | 20 |
| Zeros | 56399 |
| Zeros (%) | 84.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 519.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.67243446 |
|---|---|
| Coefficient of variation (CV) | 2.9754229 |
| Kurtosis | 71.205598 |
| Mean | 0.22599627 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.8559966 |
| Sum | 15017 |
| Variance | 0.4521681 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 56399 | |
| 1 | 6970 | 10.5% |
| 2 | 2080 | 3.1% |
| 3 | 580 | 0.9% |
| 4 | 236 | 0.4% |
| 5 | 84 | 0.1% |
| 6 | 44 | 0.1% |
| 7 | 21 | < 0.1% |
| 8 | 11 | < 0.1% |
| 9 | 7 | < 0.1% |
| Other values (8) | 16 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 56399 | |
| 1 | 6970 | 10.5% |
| 2 | 2080 | 3.1% |
| 3 | 580 | 0.9% |
| 4 | 236 | 0.4% |
| 5 | 84 | 0.1% |
| 6 | 44 | 0.1% |
| 7 | 21 | < 0.1% |
| 8 | 11 | < 0.1% |
| 9 | 7 | < 0.1% |
| Value | Count | Frequency (%) |
| 20 | 1 | < 0.1% |
| 17 | 2 | < 0.1% |
| 16 | 1 | < 0.1% |
| 15 | 2 | < 0.1% |
| 14 | 1 | < 0.1% |
| 13 | 5 | |
| 12 | 1 | < 0.1% |
| 10 | 3 | < 0.1% |
| 9 | 7 | |
| 8 | 11 |
deposit_type
Categorical
Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 519.2 KiB |
| No Deposit | |
|---|---|
| Non Refund | |
| Refundable | 145 |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | No Deposit |
|---|---|
| 2nd row | No Deposit |
| 3rd row | No Deposit |
| 4th row | No Deposit |
| 5th row | No Deposit |
Common Values
| Value | Count | Frequency (%) |
| No Deposit | 57290 | |
| Non Refund | 9013 | 13.6% |
| Refundable | 145 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| no | 57290 | |
| deposit | 57290 | |
| non | 9013 | 6.8% |
| refund | 9013 | 6.8% |
| refundable | 145 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 123593 | |
| e | 66593 | |
| N | 66303 | |
| 66303 | ||
| s | 57290 | |
| i | 57290 | |
| t | 57290 | |
| p | 57290 | |
| D | 57290 | |
| n | 18171 | 2.7% |
| Other values (7) | 37067 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 664480 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| o | 123593 | |
| e | 66593 | |
| N | 66303 | |
| 66303 | ||
| s | 57290 | |
| i | 57290 | |
| t | 57290 | |
| p | 57290 | |
| D | 57290 | |
| n | 18171 | 2.7% |
| Other values (7) | 37067 | 5.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 664480 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| o | 123593 | |
| e | 66593 | |
| N | 66303 | |
| 66303 | ||
| s | 57290 | |
| i | 57290 | |
| t | 57290 | |
| p | 57290 | |
| D | 57290 | |
| n | 18171 | 2.7% |
| Other values (7) | 37067 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 664480 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| o | 123593 | |
| e | 66593 | |
| N | 66303 | |
| 66303 | ||
| s | 57290 | |
| i | 57290 | |
| t | 57290 | |
| p | 57290 | |
| D | 57290 | |
| n | 18171 | 2.7% |
| Other values (7) | 37067 | 5.6% |
agent
Real number (ℝ)
High correlation  Missing 
| Distinct | 261 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 10124 |
| Missing (%) | 15.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 133.76218 |
| Minimum | 1 |
|---|---|
| Maximum | 535 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 519.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 9 |
| median | 134 |
| Q3 | 240 |
| 95-th percentile | 313 |
| Maximum | 535 |
| Range | 534 |
| Interquartile range (IQR) | 231 |
Descriptive statistics
| Standard deviation | 121.08427 |
|---|---|
| Coefficient of variation (CV) | 0.9052205 |
| Kurtosis | -1.1089051 |
| Mean | 133.76218 |
| Median Absolute Deviation (MAD) | 114 |
| Skewness | 0.29797347 |
| Sum | 7534021 |
| Variance | 14661.4 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 240 | 13906 | |
| 9 | 10785 | |
| 1 | 3819 | 5.7% |
| 250 | 2869 | 4.3% |
| 241 | 1721 | 2.6% |
| 6 | 1408 | 2.1% |
| 40 | 1013 | 1.5% |
| 314 | 927 | 1.4% |
| 242 | 779 | 1.2% |
| 37 | 687 | 1.0% |
| Other values (251) | 18410 | |
| (Missing) | 10124 |
| Value | Count | Frequency (%) |
| 1 | 3819 | 5.7% |
| 2 | 125 | 0.2% |
| 3 | 653 | 1.0% |
| 5 | 256 | 0.4% |
| 6 | 1408 | 2.1% |
| 7 | 633 | 1.0% |
| 8 | 661 | 1.0% |
| 9 | 10785 | |
| 10 | 52 | 0.1% |
| 11 | 225 | 0.3% |
| Value | Count | Frequency (%) |
| 535 | 3 | < 0.1% |
| 531 | 68 | |
| 527 | 35 | |
| 526 | 10 | < 0.1% |
| 510 | 2 | < 0.1% |
| 508 | 6 | < 0.1% |
| 502 | 24 | < 0.1% |
| 497 | 1 | < 0.1% |
| 495 | 50 | |
| 493 | 35 |
company
Real number (ℝ)
High correlation  Missing 
| Distinct | 273 |
|---|---|
| Distinct (%) | 7.3% |
| Missing | 62703 |
| Missing (%) | 94.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 223.18665 |
| Minimum | 6 |
|---|---|
| Maximum | 543 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 519.2 KiB |
Quantile statistics
| Minimum | 6 |
|---|---|
| 5-th percentile | 38 |
| Q1 | 110 |
| median | 223 |
| Q3 | 307 |
| 95-th percentile | 484 |
| Maximum | 543 |
| Range | 537 |
| Interquartile range (IQR) | 197 |
Descriptive statistics
| Standard deviation | 130.33971 |
|---|---|
| Coefficient of variation (CV) | 0.58399419 |
| Kurtosis | -0.51481426 |
| Mean | 223.18665 |
| Median Absolute Deviation (MAD) | 101 |
| Skewness | 0.37818308 |
| Sum | 835834 |
| Variance | 16988.439 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 223 | 784 | 1.2% |
| 281 | 138 | 0.2% |
| 154 | 133 | 0.2% |
| 67 | 107 | 0.2% |
| 405 | 101 | 0.2% |
| 94 | 87 | 0.1% |
| 47 | 67 | 0.1% |
| 135 | 64 | 0.1% |
| 331 | 59 | 0.1% |
| 498 | 58 | 0.1% |
| Other values (263) | 2147 | 3.2% |
| (Missing) | 62703 |
| Value | Count | Frequency (%) |
| 6 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 9 | 36 | |
| 10 | 1 | < 0.1% |
| 12 | 14 | < 0.1% |
| 14 | 3 | < 0.1% |
| 16 | 5 | < 0.1% |
| 20 | 50 | |
| 22 | 6 | < 0.1% |
| 28 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 543 | 2 | < 0.1% |
| 541 | 1 | < 0.1% |
| 539 | 2 | < 0.1% |
| 534 | 2 | < 0.1% |
| 531 | 1 | < 0.1% |
| 530 | 5 | < 0.1% |
| 528 | 2 | < 0.1% |
| 525 | 15 | |
| 523 | 19 | |
| 521 | 7 | < 0.1% |
days_in_waiting_list
Real number (ℝ)
Zeros 
| Distinct | 110 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.3267818 |
| Minimum | 0 |
|---|---|
| Maximum | 391 |
| Zeros | 63984 |
| Zeros (%) | 96.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 519.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 391 |
| Range | 391 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 22.223456 |
|---|---|
| Coefficient of variation (CV) | 6.6801664 |
| Kurtosis | 125.59574 |
| Mean | 3.3267818 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 9.9470935 |
| Sum | 221058 |
| Variance | 493.88202 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 63984 | |
| 39 | 186 | 0.3% |
| 58 | 164 | 0.2% |
| 31 | 102 | 0.2% |
| 69 | 89 | 0.1% |
| 63 | 80 | 0.1% |
| 87 | 80 | 0.1% |
| 111 | 71 | 0.1% |
| 101 | 65 | 0.1% |
| 77 | 63 | 0.1% |
| Other values (100) | 1564 | 2.4% |
| Value | Count | Frequency (%) |
| 0 | 63984 | |
| 1 | 7 | < 0.1% |
| 2 | 3 | < 0.1% |
| 3 | 59 | 0.1% |
| 4 | 11 | < 0.1% |
| 5 | 5 | < 0.1% |
| 6 | 4 | < 0.1% |
| 7 | 1 | < 0.1% |
| 8 | 7 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 391 | 45 | |
| 379 | 15 | < 0.1% |
| 330 | 15 | < 0.1% |
| 259 | 10 | < 0.1% |
| 236 | 35 | |
| 224 | 10 | < 0.1% |
| 223 | 60 | |
| 215 | 21 | < 0.1% |
| 207 | 15 | < 0.1% |
| 193 | 1 | < 0.1% |
customer_type
Categorical
Imbalance 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 519.2 KiB |
| Transient | |
|---|---|
| Transient-Party | |
| Contract | 2521 |
| Group | 318 |
Length
| Max length | 15 |
|---|---|
| Median length | 9 |
| Mean length | 10.177176 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Transient |
|---|---|
| 2nd row | Transient |
| 3rd row | Transient |
| 4th row | Transient |
| 5th row | Transient |
Common Values
| Value | Count | Frequency (%) |
| Transient | 49940 | |
| Transient-Party | 13669 | 20.6% |
| Contract | 2521 | 3.8% |
| Group | 318 | 0.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| transient | 49940 | |
| transient-party | 13669 | 20.6% |
| contract | 2521 | 3.8% |
| group | 318 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 129739 | |
| t | 82320 | |
| r | 80117 | |
| a | 79799 | |
| T | 63609 | |
| s | 63609 | |
| i | 63609 | |
| e | 63609 | |
| y | 13669 | 2.0% |
| - | 13669 | 2.0% |
| Other values (7) | 22504 | 3.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 676253 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| n | 129739 | |
| t | 82320 | |
| r | 80117 | |
| a | 79799 | |
| T | 63609 | |
| s | 63609 | |
| i | 63609 | |
| e | 63609 | |
| y | 13669 | 2.0% |
| - | 13669 | 2.0% |
| Other values (7) | 22504 | 3.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 676253 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| n | 129739 | |
| t | 82320 | |
| r | 80117 | |
| a | 79799 | |
| T | 63609 | |
| s | 63609 | |
| i | 63609 | |
| e | 63609 | |
| y | 13669 | 2.0% |
| - | 13669 | 2.0% |
| Other values (7) | 22504 | 3.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 676253 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| n | 129739 | |
| t | 82320 | |
| r | 80117 | |
| a | 79799 | |
| T | 63609 | |
| s | 63609 | |
| i | 63609 | |
| e | 63609 | |
| y | 13669 | 2.0% |
| - | 13669 | 2.0% |
| Other values (7) | 22504 | 3.3% |
adr
Real number (ℝ)
Zeros 
| Distinct | 7051 |
|---|---|
| Distinct (%) | 10.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 96.321678 |
| Minimum | -6.38 |
|---|---|
| Maximum | 5400 |
| Zeros | 975 |
| Zeros (%) | 1.5% |
| Negative | 1 |
| Negative (%) | < 0.1% |
| Memory size | 519.2 KiB |
Quantile statistics
| Minimum | -6.38 |
|---|---|
| 5-th percentile | 35 |
| Q1 | 62 |
| median | 85 |
| Q3 | 120 |
| 95-th percentile | 202.8625 |
| Maximum | 5400 |
| Range | 5406.38 |
| Interquartile range (IQR) | 58 |
Descriptive statistics
| Standard deviation | 56.126409 |
|---|---|
| Coefficient of variation (CV) | 0.58269759 |
| Kurtosis | 1201.0133 |
| Mean | 96.321678 |
| Median Absolute Deviation (MAD) | 26.6 |
| Skewness | 13.715711 |
| Sum | 6400382.9 |
| Variance | 3150.1738 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 62 | 2042 | 3.1% |
| 75 | 1377 | 2.1% |
| 65 | 1124 | 1.7% |
| 48 | 1036 | 1.6% |
| 80 | 1032 | 1.6% |
| 0 | 975 | 1.5% |
| 90 | 950 | 1.4% |
| 60 | 899 | 1.4% |
| 100 | 880 | 1.3% |
| 85 | 853 | 1.3% |
| Other values (7041) | 55280 |
| Value | Count | Frequency (%) |
| -6.38 | 1 | < 0.1% |
| 0 | 975 | |
| 0.26 | 1 | < 0.1% |
| 0.5 | 1 | < 0.1% |
| 1 | 3 | < 0.1% |
| 1.48 | 1 | < 0.1% |
| 1.56 | 2 | < 0.1% |
| 1.8 | 1 | < 0.1% |
| 2 | 8 | < 0.1% |
| 2.4 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 5400 | 1 | |
| 508 | 1 | |
| 450 | 1 | |
| 437 | 1 | |
| 426.25 | 1 | |
| 402 | 1 | |
| 397.38 | 1 | |
| 392 | 2 | |
| 388 | 2 | |
| 387 | 1 |
required_car_parking_spaces
Categorical
Imbalance 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 519.2 KiB |
| 0 | |
|---|---|
| 1 | 5625 |
| 2 | 25 |
| 8 | 2 |
| 3 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 60795 | |
| 1 | 5625 | 8.5% |
| 2 | 25 | < 0.1% |
| 8 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 60795 | |
| 1 | 5625 | 8.5% |
| 2 | 25 | < 0.1% |
| 8 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 60795 | |
| 1 | 5625 | 8.5% |
| 2 | 25 | < 0.1% |
| 8 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 66448 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 60795 | |
| 1 | 5625 | 8.5% |
| 2 | 25 | < 0.1% |
| 8 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 66448 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 60795 | |
| 1 | 5625 | 8.5% |
| 2 | 25 | < 0.1% |
| 8 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 66448 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 60795 | |
| 1 | 5625 | 8.5% |
| 2 | 25 | < 0.1% |
| 8 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
total_of_special_requests
Real number (ℝ)
Zeros 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.49870575 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 42616 |
| Zeros (%) | 64.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 519.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 2 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.76302471 |
|---|---|
| Coefficient of variation (CV) | 1.5300098 |
| Kurtosis | 1.9648939 |
| Mean | 0.49870575 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.5218721 |
| Sum | 33138 |
| Variance | 0.5822067 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 42616 | |
| 1 | 16055 | 24.2% |
| 2 | 6436 | 9.7% |
| 3 | 1167 | 1.8% |
| 4 | 160 | 0.2% |
| 5 | 14 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 42616 | |
| 1 | 16055 | 24.2% |
| 2 | 6436 | 9.7% |
| 3 | 1167 | 1.8% |
| 4 | 160 | 0.2% |
| 5 | 14 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 14 | < 0.1% |
| 4 | 160 | 0.2% |
| 3 | 1167 | 1.8% |
| 2 | 6436 | 9.7% |
| 1 | 16055 | 24.2% |
| 0 | 42616 |
reservation_status
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 519.2 KiB |
| Check-Out | |
|---|---|
| Canceled | |
| No-Show | 992 |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.506998 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Check-Out |
|---|---|
| 2nd row | Check-Out |
| 3rd row | Check-Out |
| 4th row | Check-Out |
| 5th row | Check-Out |
Common Values
| Value | Count | Frequency (%) |
| Check-Out | 34681 | |
| Canceled | 30775 | |
| No-Show | 992 | 1.5% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| check-out | 34681 | |
| canceled | 30775 | |
| no-show | 992 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 96231 | |
| C | 65456 | |
| c | 65456 | |
| h | 35673 | 6.3% |
| - | 35673 | 6.3% |
| u | 34681 | 6.1% |
| t | 34681 | 6.1% |
| O | 34681 | 6.1% |
| k | 34681 | 6.1% |
| a | 30775 | 5.4% |
| Other values (7) | 97285 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 565273 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 96231 | |
| C | 65456 | |
| c | 65456 | |
| h | 35673 | 6.3% |
| - | 35673 | 6.3% |
| u | 34681 | 6.1% |
| t | 34681 | 6.1% |
| O | 34681 | 6.1% |
| k | 34681 | 6.1% |
| a | 30775 | 5.4% |
| Other values (7) | 97285 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 565273 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 96231 | |
| C | 65456 | |
| c | 65456 | |
| h | 35673 | 6.3% |
| - | 35673 | 6.3% |
| u | 34681 | 6.1% |
| t | 34681 | 6.1% |
| O | 34681 | 6.1% |
| k | 34681 | 6.1% |
| a | 30775 | 5.4% |
| Other values (7) | 97285 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 565273 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 96231 | |
| C | 65456 | |
| c | 65456 | |
| h | 35673 | 6.3% |
| - | 35673 | 6.3% |
| u | 34681 | 6.1% |
| t | 34681 | 6.1% |
| O | 34681 | 6.1% |
| k | 34681 | 6.1% |
| a | 30775 | 5.4% |
| Other values (7) | 97285 |
| Distinct | 921 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 519.2 KiB |
| Minimum | 2014-11-18 00:00:00 |
|---|---|
| Maximum | 2017-09-14 00:00:00 |
Interactions
Correlations
| adr | adults | agent | arrival_date_day_of_month | arrival_date_month | arrival_date_week_number | arrival_date_year | assigned_room_type | babies | booking_changes | children | company | customer_type | days_in_waiting_list | deposit_type | distribution_channel | hotel | is_canceled | is_repeated_guest | lead_time | market_segment | meal | previous_bookings_not_canceled | previous_cancellations | required_car_parking_spaces | reservation_status | reserved_room_type | stays_in_week_nights | stays_in_weekend_nights | total_of_special_requests | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| adr | 1.000 | 0.307 | -0.001 | 0.025 | 0.000 | 0.098 | 0.000 | 0.000 | 0.000 | -0.011 | 0.000 | -0.014 | 0.000 | 0.016 | 0.008 | 0.000 | 0.000 | 0.000 | 0.000 | 0.077 | 0.000 | 0.000 | -0.143 | -0.082 | 0.000 | 0.000 | 0.000 | 0.142 | 0.096 | 0.131 |
| adults | 0.307 | 1.000 | -0.016 | 0.014 | 0.011 | 0.039 | 0.017 | 0.000 | 0.000 | -0.045 | 0.000 | 0.210 | 0.121 | -0.032 | 0.000 | 0.010 | 0.008 | 0.012 | 0.000 | 0.174 | 0.010 | 0.000 | -0.206 | -0.024 | 0.000 | 0.007 | 0.000 | 0.149 | 0.132 | 0.135 |
| agent | -0.001 | -0.016 | 1.000 | 0.009 | 0.116 | -0.073 | 0.191 | 0.145 | 0.032 | 0.147 | 0.074 | 0.525 | 0.176 | -0.102 | 0.206 | 0.199 | 0.821 | 0.345 | 0.077 | -0.119 | 0.283 | 0.218 | 0.069 | 0.043 | 0.146 | 0.246 | 0.153 | 0.157 | 0.152 | 0.215 |
| arrival_date_day_of_month | 0.025 | 0.014 | 0.009 | 1.000 | 0.067 | 0.071 | 0.057 | 0.014 | 0.004 | 0.009 | 0.011 | 0.082 | 0.033 | 0.033 | 0.075 | 0.039 | 0.044 | 0.030 | 0.014 | -0.008 | 0.044 | 0.047 | 0.007 | -0.021 | 0.012 | 0.030 | 0.014 | -0.020 | -0.004 | 0.011 |
| arrival_date_month | 0.000 | 0.011 | 0.116 | 0.067 | 1.000 | 0.798 | 0.440 | 0.041 | 0.028 | 0.012 | 0.075 | 0.294 | 0.121 | 0.082 | 0.130 | 0.075 | 0.151 | 0.086 | 0.096 | 0.136 | 0.106 | 0.109 | 0.017 | 0.046 | 0.023 | 0.082 | 0.061 | 0.051 | 0.068 | 0.071 |
| arrival_date_week_number | 0.098 | 0.039 | -0.073 | 0.071 | 0.798 | 1.000 | 0.447 | 0.044 | 0.025 | 0.006 | 0.067 | 0.017 | 0.121 | 0.029 | 0.122 | 0.071 | 0.145 | 0.080 | 0.099 | 0.126 | 0.098 | 0.100 | -0.058 | 0.045 | 0.020 | 0.076 | 0.057 | 0.033 | 0.025 | 0.024 |
| arrival_date_year | 0.000 | 0.017 | 0.191 | 0.057 | 0.440 | 0.447 | 1.000 | 0.075 | 0.007 | 0.022 | 0.047 | 0.318 | 0.168 | 0.087 | 0.071 | 0.036 | 0.182 | 0.204 | 0.076 | 0.141 | 0.121 | 0.105 | 0.036 | 0.059 | 0.017 | 0.144 | 0.106 | 0.033 | 0.054 | 0.089 |
| assigned_room_type | 0.000 | 0.000 | 0.145 | 0.014 | 0.041 | 0.044 | 0.075 | 1.000 | 0.057 | 0.075 | 0.321 | 0.088 | 0.084 | 0.038 | 0.224 | 0.109 | 0.403 | 0.295 | 0.087 | 0.063 | 0.141 | 0.107 | 0.012 | 0.016 | 0.101 | 0.210 | 0.784 | 0.053 | 0.078 | 0.085 |
| babies | 0.000 | 0.000 | 0.032 | 0.004 | 0.028 | 0.025 | 0.007 | 0.057 | 1.000 | 0.020 | 0.033 | 0.048 | 0.013 | 0.000 | 0.027 | 0.027 | 0.055 | 0.047 | 0.010 | 0.000 | 0.037 | 0.020 | 0.000 | 0.000 | 0.027 | 0.033 | 0.052 | 0.000 | 0.017 | 0.094 |
| booking_changes | -0.011 | -0.045 | 0.147 | 0.009 | 0.012 | 0.006 | 0.022 | 0.075 | 0.020 | 1.000 | 0.021 | 0.136 | 0.035 | -0.017 | 0.032 | 0.029 | 0.045 | 0.062 | 0.000 | 0.016 | 0.022 | 0.014 | 0.028 | -0.024 | 0.020 | 0.043 | 0.015 | 0.097 | 0.066 | 0.064 |
| children | 0.000 | 0.000 | 0.074 | 0.011 | 0.075 | 0.067 | 0.047 | 0.321 | 0.033 | 0.021 | 1.000 | 0.038 | 0.057 | 0.021 | 0.079 | 0.041 | 0.055 | 0.034 | 0.030 | 0.028 | 0.105 | 0.033 | 0.000 | 0.000 | 0.030 | 0.032 | 0.379 | 0.010 | 0.035 | 0.050 |
| company | -0.014 | 0.210 | 0.525 | 0.082 | 0.294 | 0.017 | 0.318 | 0.088 | 0.048 | 0.136 | 0.038 | 1.000 | 0.299 | -0.007 | 0.237 | 0.266 | 0.437 | 0.232 | 0.181 | 0.186 | 0.348 | 0.229 | -0.074 | -0.043 | 0.066 | 0.165 | 0.102 | 0.129 | 0.111 | 0.087 |
| customer_type | 0.000 | 0.121 | 0.176 | 0.033 | 0.121 | 0.121 | 0.168 | 0.084 | 0.013 | 0.035 | 0.057 | 0.299 | 1.000 | 0.105 | 0.152 | 0.091 | 0.065 | 0.234 | 0.148 | 0.071 | 0.294 | 0.134 | 0.033 | 0.005 | 0.046 | 0.166 | 0.108 | 0.107 | 0.126 | 0.106 |
| days_in_waiting_list | 0.016 | -0.032 | -0.102 | 0.033 | 0.082 | 0.029 | 0.087 | 0.038 | 0.000 | -0.017 | 0.021 | -0.007 | 0.105 | 1.000 | 0.121 | 0.031 | 0.177 | 0.050 | 0.024 | 0.170 | 0.089 | 0.064 | -0.032 | -0.025 | 0.039 | 0.039 | 0.034 | -0.001 | -0.091 | -0.128 |
| deposit_type | 0.008 | 0.000 | 0.206 | 0.075 | 0.130 | 0.122 | 0.071 | 0.224 | 0.027 | 0.032 | 0.079 | 0.237 | 0.152 | 0.121 | 1.000 | 0.096 | 0.335 | 0.408 | 0.065 | 0.291 | 0.369 | 0.084 | 0.021 | 0.063 | 0.088 | 0.297 | 0.173 | 0.064 | 0.100 | 0.209 |
| distribution_channel | 0.000 | 0.010 | 0.199 | 0.039 | 0.075 | 0.071 | 0.036 | 0.109 | 0.027 | 0.029 | 0.041 | 0.266 | 0.091 | 0.031 | 0.096 | 1.000 | 0.255 | 0.227 | 0.213 | 0.113 | 0.670 | 0.068 | 0.109 | 0.036 | 0.083 | 0.164 | 0.117 | 0.011 | 0.066 | 0.073 |
| hotel | 0.000 | 0.008 | 0.821 | 0.044 | 0.151 | 0.145 | 0.182 | 0.403 | 0.055 | 0.045 | 0.055 | 0.437 | 0.065 | 0.177 | 0.335 | 0.255 | 1.000 | 0.494 | 0.134 | 0.165 | 0.234 | 0.318 | 0.067 | 0.038 | 0.229 | 0.495 | 0.323 | 0.150 | 0.182 | 0.214 |
| is_canceled | 0.000 | 0.012 | 0.345 | 0.030 | 0.086 | 0.080 | 0.204 | 0.295 | 0.047 | 0.062 | 0.034 | 0.232 | 0.234 | 0.050 | 0.408 | 0.227 | 0.494 | 1.000 | 0.138 | 0.229 | 0.256 | 0.186 | 0.074 | 0.047 | 0.292 | 1.000 | 0.120 | 0.068 | 0.054 | 0.212 |
| is_repeated_guest | 0.000 | 0.000 | 0.077 | 0.014 | 0.096 | 0.099 | 0.076 | 0.087 | 0.010 | 0.000 | 0.030 | 0.181 | 0.148 | 0.024 | 0.065 | 0.213 | 0.134 | 0.138 | 1.000 | 0.120 | 0.264 | 0.053 | 0.315 | 0.067 | 0.085 | 0.138 | 0.030 | 0.019 | 0.077 | 0.064 |
| lead_time | 0.077 | 0.174 | -0.119 | -0.008 | 0.136 | 0.126 | 0.141 | 0.063 | 0.000 | 0.016 | 0.028 | 0.186 | 0.071 | 0.170 | 0.291 | 0.113 | 0.165 | 0.229 | 0.120 | 1.000 | 0.177 | 0.086 | -0.188 | 0.075 | 0.069 | 0.172 | 0.048 | 0.358 | 0.209 | -0.082 |
| market_segment | 0.000 | 0.010 | 0.283 | 0.044 | 0.106 | 0.098 | 0.121 | 0.141 | 0.037 | 0.022 | 0.105 | 0.348 | 0.294 | 0.089 | 0.369 | 0.670 | 0.234 | 0.256 | 0.264 | 0.177 | 1.000 | 0.177 | 0.093 | 0.038 | 0.106 | 0.190 | 0.154 | 0.047 | 0.079 | 0.204 |
| meal | 0.000 | 0.000 | 0.218 | 0.047 | 0.109 | 0.100 | 0.105 | 0.107 | 0.020 | 0.014 | 0.033 | 0.229 | 0.134 | 0.064 | 0.084 | 0.068 | 0.318 | 0.186 | 0.053 | 0.086 | 0.177 | 1.000 | 0.016 | 0.088 | 0.031 | 0.135 | 0.090 | 0.048 | 0.077 | 0.045 |
| previous_bookings_not_canceled | -0.143 | -0.206 | 0.069 | 0.007 | 0.017 | -0.058 | 0.036 | 0.012 | 0.000 | 0.028 | 0.000 | -0.074 | 0.033 | -0.032 | 0.021 | 0.109 | 0.067 | 0.074 | 0.315 | -0.188 | 0.093 | 0.016 | 1.000 | 0.125 | 0.026 | 0.052 | 0.006 | -0.114 | -0.090 | 0.025 |
| previous_cancellations | -0.082 | -0.024 | 0.043 | -0.021 | 0.046 | 0.045 | 0.059 | 0.016 | 0.000 | -0.024 | 0.000 | -0.043 | 0.005 | -0.025 | 0.063 | 0.036 | 0.038 | 0.047 | 0.067 | 0.075 | 0.038 | 0.088 | 0.125 | 1.000 | 0.000 | 0.034 | 0.011 | 0.008 | 0.007 | -0.031 |
| required_car_parking_spaces | 0.000 | 0.000 | 0.146 | 0.012 | 0.023 | 0.020 | 0.017 | 0.101 | 0.027 | 0.020 | 0.030 | 0.066 | 0.046 | 0.039 | 0.088 | 0.083 | 0.229 | 0.292 | 0.085 | 0.069 | 0.106 | 0.031 | 0.026 | 0.000 | 1.000 | 0.206 | 0.083 | 0.022 | 0.021 | 0.060 |
| reservation_status | 0.000 | 0.007 | 0.246 | 0.030 | 0.082 | 0.076 | 0.144 | 0.210 | 0.033 | 0.043 | 0.032 | 0.165 | 0.166 | 0.039 | 0.297 | 0.164 | 0.495 | 1.000 | 0.138 | 0.172 | 0.190 | 0.135 | 0.052 | 0.034 | 0.206 | 1.000 | 0.085 | 0.052 | 0.041 | 0.152 |
| reserved_room_type | 0.000 | 0.000 | 0.153 | 0.014 | 0.061 | 0.057 | 0.106 | 0.784 | 0.052 | 0.015 | 0.379 | 0.102 | 0.108 | 0.034 | 0.173 | 0.117 | 0.323 | 0.120 | 0.030 | 0.048 | 0.154 | 0.090 | 0.006 | 0.011 | 0.083 | 0.085 | 1.000 | 0.047 | 0.064 | 0.089 |
| stays_in_week_nights | 0.142 | 0.149 | 0.157 | -0.020 | 0.051 | 0.033 | 0.033 | 0.053 | 0.000 | 0.097 | 0.010 | 0.129 | 0.107 | -0.001 | 0.064 | 0.011 | 0.150 | 0.068 | 0.019 | 0.358 | 0.047 | 0.048 | -0.114 | 0.008 | 0.022 | 0.052 | 0.047 | 1.000 | 0.390 | 0.105 |
| stays_in_weekend_nights | 0.096 | 0.132 | 0.152 | -0.004 | 0.068 | 0.025 | 0.054 | 0.078 | 0.017 | 0.066 | 0.035 | 0.111 | 0.126 | -0.091 | 0.100 | 0.066 | 0.182 | 0.054 | 0.077 | 0.209 | 0.079 | 0.077 | -0.090 | 0.007 | 0.021 | 0.041 | 0.064 | 0.390 | 1.000 | 0.110 |
| total_of_special_requests | 0.131 | 0.135 | 0.215 | 0.011 | 0.071 | 0.024 | 0.089 | 0.085 | 0.094 | 0.064 | 0.050 | 0.087 | 0.106 | -0.128 | 0.209 | 0.073 | 0.214 | 0.212 | 0.064 | -0.082 | 0.204 | 0.045 | 0.025 | -0.031 | 0.060 | 0.152 | 0.089 | 0.105 | 0.110 | 1.000 |
Missing values
Sample
| hotel | is_canceled | lead_time | arrival_date_year | arrival_date_month | arrival_date_week_number | arrival_date_day_of_month | stays_in_weekend_nights | stays_in_week_nights | adults | children | babies | meal | country | market_segment | distribution_channel | is_repeated_guest | previous_cancellations | previous_bookings_not_canceled | reserved_room_type | assigned_room_type | booking_changes | deposit_type | agent | company | days_in_waiting_list | customer_type | adr | required_car_parking_spaces | total_of_special_requests | reservation_status | reservation_status_date | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Resort Hotel | 0 | 342 | 2015 | July | 27 | 1 | 0 | 0 | 2 | 0.0 | 0 | BB | PRT | Direct | Direct | 0 | 0 | 0 | C | C | 3 | No Deposit | NaN | NaN | 0 | Transient | 0.0 | 0 | 0 | Check-Out | 2015-07-01 |
| 1 | Resort Hotel | 0 | 737 | 2015 | July | 27 | 1 | 0 | 0 | 2 | 0.0 | 0 | BB | PRT | Direct | Direct | 0 | 0 | 0 | C | C | 4 | No Deposit | NaN | NaN | 0 | Transient | 0.0 | 0 | 0 | Check-Out | 2015-07-01 |
| 2 | Resort Hotel | 0 | 7 | 2015 | July | 27 | 1 | 0 | 1 | 1 | 0.0 | 0 | BB | GBR | Direct | Direct | 0 | 0 | 0 | A | C | 0 | No Deposit | NaN | NaN | 0 | Transient | 75.0 | 0 | 0 | Check-Out | 2015-07-02 |
| 3 | Resort Hotel | 0 | 13 | 2015 | July | 27 | 1 | 0 | 1 | 1 | 0.0 | 0 | BB | GBR | Corporate | Corporate | 0 | 0 | 0 | A | A | 0 | No Deposit | 304.0 | NaN | 0 | Transient | 75.0 | 0 | 0 | Check-Out | 2015-07-02 |
| 4 | Resort Hotel | 0 | 14 | 2015 | July | 27 | 1 | 0 | 2 | 2 | 0.0 | 0 | BB | GBR | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 240.0 | NaN | 0 | Transient | 98.0 | 0 | 1 | Check-Out | 2015-07-03 |
| 5 | Resort Hotel | 0 | 14 | 2015 | July | 27 | 1 | 0 | 2 | 2 | 0.0 | 0 | BB | GBR | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 240.0 | NaN | 0 | Transient | 98.0 | 0 | 1 | Check-Out | 2015-07-03 |
| 6 | Resort Hotel | 0 | 0 | 2015 | July | 27 | 1 | 0 | 2 | 2 | 0.0 | 0 | BB | PRT | Direct | Direct | 0 | 0 | 0 | C | C | 0 | No Deposit | NaN | NaN | 0 | Transient | 107.0 | 0 | 0 | Check-Out | 2015-07-03 |
| 7 | Resort Hotel | 0 | 9 | 2015 | July | 27 | 1 | 0 | 2 | 2 | 0.0 | 0 | FB | PRT | Direct | Direct | 0 | 0 | 0 | C | C | 0 | No Deposit | 303.0 | NaN | 0 | Transient | 103.0 | 0 | 1 | Check-Out | 2015-07-03 |
| 8 | Resort Hotel | 1 | 85 | 2015 | July | 27 | 1 | 0 | 3 | 2 | 0.0 | 0 | BB | PRT | Online TA | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 240.0 | NaN | 0 | Transient | 82.0 | 0 | 1 | Canceled | 2015-05-06 |
| 9 | Resort Hotel | 1 | 75 | 2015 | July | 27 | 1 | 0 | 3 | 2 | 0.0 | 0 | HB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | D | D | 0 | No Deposit | 15.0 | NaN | 0 | Transient | 105.5 | 0 | 0 | Canceled | 2015-04-22 |
| hotel | is_canceled | lead_time | arrival_date_year | arrival_date_month | arrival_date_week_number | arrival_date_day_of_month | stays_in_weekend_nights | stays_in_week_nights | adults | children | babies | meal | country | market_segment | distribution_channel | is_repeated_guest | previous_cancellations | previous_bookings_not_canceled | reserved_room_type | assigned_room_type | booking_changes | deposit_type | agent | company | days_in_waiting_list | customer_type | adr | required_car_parking_spaces | total_of_special_requests | reservation_status | reservation_status_date | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 66438 | City Hotel | 1 | 98 | 2017 | April | 16 | 20 | 0 | 3 | 2 | 0.0 | 0 | BB | GBR | Online TA | TA/TO | 0 | 0 | 0 | D | D | 0 | No Deposit | 9.0 | NaN | 0 | Transient | 136.8 | 0 | 2 | Canceled | 2017-01-29 |
| 66439 | City Hotel | 1 | 67 | 2017 | April | 16 | 20 | 0 | 3 | 2 | 0.0 | 0 | BB | ISR | Online TA | TA/TO | 0 | 0 | 0 | D | D | 0 | No Deposit | 9.0 | NaN | 0 | Transient | 135.0 | 0 | 0 | Canceled | 2017-02-16 |
| 66440 | City Hotel | 1 | 168 | 2017 | April | 16 | 20 | 0 | 3 | 3 | 0.0 | 0 | BB | BRA | Online TA | TA/TO | 0 | 0 | 0 | D | D | 0 | No Deposit | 9.0 | NaN | 0 | Transient | 166.5 | 0 | 3 | Canceled | 2017-03-06 |
| 66441 | City Hotel | 1 | 125 | 2017 | April | 16 | 20 | 0 | 3 | 2 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 33.0 | NaN | 0 | Transient | 85.0 | 0 | 0 | Canceled | 2016-12-16 |
| 66442 | City Hotel | 1 | 125 | 2017 | April | 16 | 20 | 0 | 3 | 2 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 33.0 | NaN | 0 | Transient | 85.0 | 0 | 0 | Canceled | 2016-12-16 |
| 66443 | City Hotel | 1 | 125 | 2017 | April | 16 | 20 | 0 | 3 | 2 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 33.0 | NaN | 0 | Transient | 85.0 | 0 | 0 | Canceled | 2016-12-16 |
| 66444 | City Hotel | 1 | 125 | 2017 | April | 16 | 20 | 0 | 3 | 2 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 33.0 | NaN | 0 | Transient | 85.0 | 0 | 0 | Canceled | 2016-12-16 |
| 66445 | City Hotel | 1 | 143 | 2017 | April | 16 | 20 | 0 | 3 | 2 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 33.0 | NaN | 18 | Transient | 85.0 | 0 | 0 | Canceled | 2016-12-16 |
| 66446 | City Hotel | 1 | 125 | 2017 | April | 16 | 20 | 0 | 3 | 2 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 33.0 | NaN | 0 | Transient | 85.0 | 0 | 0 | Canceled | 2016-12-16 |
| 66447 | City Hotel | 1 | 125 | 2017 | April | 16 | 20 | 0 | 3 | 2 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 33.0 | NaN | 0 | Transient | 85.0 | 0 | 0 | Canceled | NaN |
Duplicate rows
Most frequently occurring
| hotel | is_canceled | lead_time | arrival_date_year | arrival_date_month | arrival_date_week_number | arrival_date_day_of_month | stays_in_weekend_nights | stays_in_week_nights | adults | children | babies | meal | country | market_segment | distribution_channel | is_repeated_guest | previous_cancellations | previous_bookings_not_canceled | reserved_room_type | assigned_room_type | booking_changes | deposit_type | agent | company | days_in_waiting_list | customer_type | adr | required_car_parking_spaces | total_of_special_requests | reservation_status | reservation_status_date | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1871 | City Hotel | 1 | 277 | 2016 | November | 46 | 7 | 1 | 2 | 2 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | NaN | NaN | 0 | Transient | 100.0 | 0 | 0 | Canceled | 2016-04-04 | 180 |
| 1691 | City Hotel | 1 | 188 | 2016 | June | 25 | 15 | 0 | 2 | 1 | 0.0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 119.0 | NaN | 39 | Transient | 130.0 | 0 | 0 | Canceled | 2016-01-18 | 109 |
| 1585 | City Hotel | 1 | 158 | 2016 | May | 22 | 24 | 0 | 2 | 1 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 37.0 | NaN | 31 | Transient | 130.0 | 0 | 0 | Canceled | 2016-01-18 | 101 |
| 815 | City Hotel | 1 | 28 | 2017 | March | 9 | 2 | 0 | 3 | 2 | 0.0 | 0 | BB | PRT | Groups | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | NaN | NaN | 0 | Transient | 95.0 | 0 | 0 | Canceled | 2017-02-02 | 99 |
| 903 | City Hotel | 1 | 38 | 2017 | January | 2 | 14 | 0 | 1 | 1 | 0.0 | 0 | BB | PRT | Corporate | Corporate | 0 | 0 | 0 | A | A | 0 | Non Refund | NaN | 67.0 | 0 | Transient | 75.0 | 0 | 0 | Canceled | 2016-12-07 | 99 |
| 1138 | City Hotel | 1 | 71 | 2016 | June | 25 | 14 | 0 | 3 | 1 | 0.0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 236.0 | NaN | 0 | Transient | 120.0 | 0 | 0 | Canceled | 2016-04-27 | 89 |
| 1620 | City Hotel | 1 | 166 | 2016 | November | 45 | 1 | 0 | 3 | 1 | 0.0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 236.0 | NaN | 0 | Transient | 110.0 | 0 | 0 | Canceled | 2016-07-13 | 85 |
| 1919 | City Hotel | 1 | 304 | 2016 | November | 45 | 3 | 0 | 3 | 2 | 0.0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 21.0 | NaN | 0 | Transient | 89.0 | 0 | 0 | Canceled | 2016-02-01 | 85 |
| 1920 | City Hotel | 1 | 305 | 2016 | November | 45 | 4 | 1 | 2 | 2 | 0.0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | Non Refund | 21.0 | NaN | 0 | Transient | 89.0 | 0 | 0 | Canceled | 2016-02-01 | 85 |
| 892 | City Hotel | 1 | 37 | 2016 | October | 42 | 13 | 0 | 3 | 2 | 0.0 | 0 | BB | PRT | Offline TA/TO | TA/TO | 0 | 0 | 0 | A | A | 0 | No Deposit | 56.0 | NaN | 0 | Transient-Party | 105.0 | 0 | 0 | Canceled | 2016-09-06 | 84 |